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Abstract — This  paper  summarizes  the  characteristics  of  the 
electricity  load  data  collected  every  15  minutes  of  power  users. 
The  single-day  load  data  of  single  power  user  is  taken  as  a 96- 
length  vector,  and  the  single-day  data  of  n users  is  taken  as  a 
96-column  set.  Single- user  monthly  data  and  annual  data  are 
described  as  a 30  (31)  x 96  matrix  and  a 365  (366)  x 96  matrix 
respectively.  By  comparing  the  wavelet  transforms  of  multiple 
wavelet  basis  functions,  the  Daubechies  4 wavelet  basis 
function  is  chosen  as  the  wavelet  transform  basis  function  of 
the  electricity  load  data  according  to  the  Shannon  entropy  of 
the  obtained  wavelet  coefficients.  The  difference  between  the 
inverse  wavelet  transformed  data  and  the  original  data  is 
compared  under  various  characteristic  wavelet  coefficients. 
Besides,  the  number  of  the  characteristic  wavelet  coefficients 
and  the  error  of  the  wavelet  transform  are  determined.  The 
results  show  that  there  is  a high  redundancy  in  the  original 
power  load  data,  and  the  storage  space  can  be  greatly 
compressed  by  wavelet  transform  to  achieve  feature  extraction 
and  data  desensitization.  Therefore,  the  feature  extraction 
technology  of  power  load  data  based  on  wavelet  transform  in 
this  paper  has  potential  application  value  and  popularization 
value,  and  can  produce  high  economic  efficiency. 

Keywords-wavelet  transform;  wavelet  multi-resolution 
decomposition;  electric  information  acquisition  data 

I.  Introduction 

At  present,  the  load  data  of  power  users  has  become  an 
important  data  source  for  government  to  carry  out  smart  city 
construction  and  for  telecom  operators,  Internet  companies, 
finance  and  insurance  industry  to  make  customer  portraits,  as 
well  as  for  retail  entertainment  enterprises  to  assist 
commercial  location.  However,  due  to  the  large  number  of 
meters,  the  high  sampling  frequency  and  the  huge  data 
amount,  a network  with  large  storage  capacity,  high 
bandwidth  and  high  computing  performance  is  in  great  need 
to  perform  data  handle,  storage,  process  and  analysis. 
Therefore,  a high  precision  feature  extraction  of  load  data  in 
application  of  data  mining  algorithm  is  actually  in  urgent 
need. 

At  present,  the  methods  of  data  characterization  are 
mainly  based  on  three  categories:  statistics,  model  and 
transformation.  Among  which,  the  characterization  based  on 
transformation  refers  to  the  highlight  of  classification 
characteristic  by  means  of  transfonnation,  including  time- 
frequency  transformation  and  linear  transfonnation  [1-3]. 
Representative  time-frequency  conversion  methods  include 
fast  Fourier  transfonn,  short-time  Fourier  transfonn  and 


cepstral  coefficients.  The  classic  linear  transfonnation  is 
principal  component  analysis  (PCA),  singular  value 
decomposition  (SVD),  linear  discriminant  analysis,  K-L 
transfonn,  wavelet  transfonn  and  wavelet  packet  technology 
[4-6]. 

The  generalized  empirical  wavelet  transfonn  is  used  to 
reproduce  the  single  and  composite  power  quality 
distribution,  which  has  a good  effect  on  the  precise 
classification  of  power  quality  [7].  The  time  series  is 
transfonned  from  time  domain  to  frequency  domain  through 
signal  processing,  and  the  finite  time  series  in  the  frequency 
domain  is  employed  to  approximately  describe  the  original 
time  series  [8].  In  paper  [9],  an  approximate  optimal 
histogram  construction  algorithm  based  on  workload  is 
proposed,  a method  of  calculating  the  unselected  wavelet 
coefficients  under  the  L2  error  metric  and  min-max  error 
metric  is  presented,  and  an  improved  table  semantic 
aggregation  method  and  that  of  graph  is  studied. 

However,  whether  the  wavelet  transfonn  is  suitable  for 
the  feature  extraction  of  the  electricity  load  data,  whether  can 
it  ensure  the  characteristic  can  indicate  users'  power 
consumption  trend,  and  whether  the  detailed  electricity 
infonnation  can  be  expressed  by  means  of  flexible  selection 
of  the  characteristic  coefficients  are  all  under  consideration. 

In  general,  this  paper  studies  the  structure  feature  of 
power  users’  electricity  consumption  behavior,  and  selects 
the  appropriate  wavelet  basis  function  to  carry  out  multi- 
resolution decomposition  of  electricity  load  data.  Thus  the 
electrical  infonnation  characteristics  of  each  power  user  are 
extracted  to  achieve  data  compression  and  data 
desensitization.  In  addition,  this  paper  analyzes  the  loss  of 
the  original  data  caused  by  wavelet  transfonn  through  the 
inverse  wavelet  transfonn  of  the  characteristic  value  of 
electricity  infonnation,  thus  the  feasibility  of  the  feature 
extraction  method  studied  above  is  evaluated. 

II.  Extraction  and  Selection  of  Wavelet  Feature 
A.  Wavelet  Transform 

In  application  of  Wavelet  Transfonn  (WT)  to  analyze  the 
original  signal,  the  dual  localization  property  in  both  time 
domain  and  frequency  domain  could  be  achieved.  The 
conesponding  sampling  step  is  applied  for  different  signals 
in  time  domain,  so  as  to  focus  on  any  slight  detail  of  the 
signal.  WT  is  considered  to  achieve  better  effect  than  Fourier 
Transfonn  and  windowed  Fourier  Transfonn,  and  a 
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breakthrough  of  modern  Fourier  analysis.  Besides,  it  is 
praised  as  “the  mathematical  microscope”  [10-13]. 

As  with  a square  integrable  function  y/(t)  , if 

y/(t)  dt  = 0 , then  y/(t)  is  called  a wavelet.  Typically  a set 

■QO 

of  wavelets  scaled  and  translated  from  single  wavelet 
function  is  applied  during  a wavelet  transform.  As  defined  by 

Va  b(t)  = lal2V/(~)  ’ (a>  b^R’  (!) 

where  a is  the  scale  parameter,  b is  the  translation  parameter 
and  yj  is  called  the  basic  wavelet  or  the  mother  wavelet. 

Wavelet  transform  can  be  divided  into  two  types, 
Continue  Wavelet  Transform  (CWT)  and  Discrete  Wavelet 
Transform  (DWT).  CWT  is  commonly  employed  for 
theoretical  research  and  DWT  whose  scale  variables  and 
translated  variables  both  have  been  discretized  is  more 
frequently  used  in  practical  application. 

Define  a = <J0,  b = kb0d(p  a0>l , b0>0,  where  the  values 
of  a0  and  b0  depend  on  the  specific  form  of  the  mother 
wavelet,  j and  k are  integers.  Thus,  the  discrete  wavelet  is 
defined  as: 


Let  Vj  denotes  all  the  function  spaces  that  is  constant  at 
1/2  j interval,  which  are  all  vector  spaces.  The  scale  function 

{cp.  k=  0,  1,  ...,  2/-l}  is  employed  to  constitute  a base  of 
Vj , and  the  satisfied  nest  relation  is  as  shown 

V0cV1c...cVj<=VJ+1cz...  (6) 

We  define  a function  of  W/  shown  as 
Wj={h^  Vj+i’.  (/?,y)=0,  V/e  Vj}  as  orthogonal  complement 
of  Vj  in  Vj+ 1,  which  means  VJ+1=Vj  ® Wp  in  which  Wf  is  the 
lost  detailed  information  when  replacing  Vj+\  with  Vj,  where 
the  former  indicates  a function  space  at  high  resolution  and 
the  latter  indicates  that  at  low  resolution.  Thus,  for  any  non- 
negative integer  n > 0,  the  space  decomposition  of  Vn+]  is 
shown  as 

vn+l=vn  © wn 

= VnA  0 WnA@  Wn=.=V0  ®W0®W{  ©...©  Wn  (7) 

The  multi-resolution  analysis  of  any  discrete  sequence 
f„+l  e V„+i  is  as  modeled 


V,  k(0  = ~F=  V( J1 ~ )=4/2vV0t  - kb0 ) (2) 

J 5 J ao 

Jo 

The  corresponding  discrete  wavelet  transfonn  is  as 
defined  by, 

= (Jit),  yr  k(t))  = af2  Vftt)  V(ax>t-  kb0)dt  (3) 

If  there  are  positive  numbers  A and  B , for  0 <A^B<  go 
and  V f{x)  ^ L2{R),  if  there  is 

All/ll2  <Sy  Si:  (fVjj)\  <B\\f\\2  j,k<EZ  (4) 

Then  {lFj  ^(x)}  is  called  a wavelet  framework  of  L2(R ), 
And  B are  called  frame  bounds.  B<o o guarantee  that  the 
transform  f ->  {(f  y/  , ) } is  continuous  while  A> 0 ensures 

that  the  transfonn  is  reversible  and  continuously  inverse. 

If  the  discrete  wavelet  sequence  { lFj  ^(x)}  constitutes  a 
wavelet  frame,  then  there  must  be  a corresponding  dual 
sequence  { ^ a(T)}5  so  that / (x)  could  simply  be  expressed  as 

f(x)  = HjJAx),  VjJx))  Vj,k(x)=Zji:k  WTf{j,  kfx  k(x)  (5) 

B.  Multi- Resolution  Decomposition 

The  resolution  is  low  under  large  WT  scale,  which  is 
suitable  for  general  analysis  while  that  is  high  under  small 
scale,  suitable  for  detailed  observation.  However,  the  quality 
factor  (the  ratio  of  the  center  frequency  and  the  bandwidth) 
analyzed  at  different  scales  remain  unchanged.  The  step-by- 
step  analysis  of  signals  is  called  multi-resolution 
decomposition  analysis  [14]. 


fn+j-fn+d„-fn_l+dn_l+dn-...-f0+d0+dl+...+d„  (8) 

where 

f=  Sto  vj,  kVjj  k^Vj,j=0,l,...,n+l 

dj=lVo  wJ,kVj,k  e Wj,  j-0,  1,  n+\  , vjM  is  the  scale 

coefficient  and  Wj^  is  the  wavelet  coefficient  respectively. 

C.  Selection  of  Wavelet  Basis  Function 

The  selection  of  wavelet  basis  function  is  of  vital 
importance  during  data  process  based  on  WT.  If  the  basis 
function  and  the  original  signal  have  a certain  similarity,  a 
better  conversion  effect  could  be  achieved.  The  minimized 
concave  cost  function  is  basically  employed  to  pick  the  best 
base  from  the  base  dictionary.  In  this  way,  the  tools  in  the 
classical  harmonic  analysis  like  gate  valve  function  and 
Shannon  entropy  could  be  used  to  measure  the  similarity 
between  the  base  function  and  the  original  signal. 

In  this  paper,  the  optimal  wavelet  basis  function  of 
electricity  load  data  is  selected  based  on  the  Shannon  entropy 
minimum  principle  of  wavelet  decomposition  coefficient. 
During  the  decomposition,  if  the  coefficient  contains  the 
smallest  entropy,  then  the  selected  basis  function  is  the 
optimal  wavelet  basis  function.  The  wavelet  coefficients  are 
expressed  as  {&>,},  and  Shannon  entropy  is  used  to  select  the 
cost  function  of  wavelet  basis  function,  as  defined  by 

H(co)=  -XMI2log2KI2  (9) 

D.  Extraction  of  Feature  Wavelet  Coefficients 

1)  Standardization  of  wavelet  coefficients 
Data  characterization  devotes  to  the  simplification  of  data 
collection  and  transmission,  data  compression  and  data 
desensitization,  which  maximizes  the  intrinsic  value  of  the 
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data.  As  a result,  during  feature  extraction  of  electricity 
information  based  on  WT,  the  data  requirement  of  both 
internal  business  scene  and  external  scene  should  be  satisfied. 
Thus  not  only  can  useful  information  for  business  analysis  be 
maximally  kept,  but  also  the  storage  space  of  the  original 
data  could  be  compressed  to  the  most. 

In  this  paper,  to  confirm  the  feature  wavelet  coefficients, 
initially  we  can  sort  all  the  wavelet  coefficients  according  to 
their  absolute  values,  then  detennine  an  optimal  feature 
wavelet  coefficient  retention  number  B,  after  which  the  B 
wavelet  coefficients  with  larger  absolute  values  are  taken  as 
the  feature  coefficients  of  the  original  data,  and  the 
remaining  wavelet  coefficients  are  set  to  zero. 

For  any  wavelet  coefficient,  the  closer  it  is  to  zero,  it  will 
lose  less  information  when  set  to  zero.  Besides,  the 
information  loss  is  also  affected  by  the  level  of  multi- 
resolution decomposition.  For  the  above  reasons, 
standardization  of  wavelet  coefficients  is  essential,  which  is 
modeled  as 

Wcill[i\*=  Wall[i\/sj2leveliWam  (10) 

where  level(x)  indicates  the  level  of  wavelet  coefficient  x in 
multi-resolution  decomposition,  Wall[i ] indicates  the  i-th 
wavelet  coefficient  and  Wall[i ]*  indicates  the  i-th 
standardized  wavelet  coefficient. 

2)  Selection  of  wavelet  coefficients 
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/ Confirmation  of  the 

optimal  B ) 

Figure  1.  selection  of  wavelet  coefficients 

From  what  studies  above,  the  determination  of  retained 
feature  wavelet  coefficients  number  B directly  affects  how 
much  the  wavelet  coefficients  contain  the  information  of  the 
original  data,  as  well  as  determines  the  compression  ratio  of 


the  original  data  after  data  characterization.  Thus,  the 
determination  of  the  B value  is  the  most  critical  step  in  data 
characterization  based  on  wavelet  transfonn. 

In  this  paper,  through  the  analysis  of  the  error  between 
the  data  obtained  from  inverse  wavelet  transfonn  and  the 
original  data,  the  optimal  value  of  B is  detennined,  so  as  to 
achieve  transmission  simplification,  data  compression  and 
data  desensitization  while  ensuring  the  demand  of  business 
analysis.  Detailed  steps  are  shown  in  Fig.  1,  when  different 
numbers  of  feature  wavelet  coefficients  are  retained,  the 
errors  between  the  corresponding  inverse  wavelet  transfonn 
data  and  the  original  data  are  calculated.  Through  the 
analysis  of  the  transfonnation  of  the  error,  we  can  detennine 
the  optimal  retain  number  of  feature  wavelet  coefficients  B. 

III.  Experimental  Analysis  of  Electricity  Load 

Data 

The  electricity  load  data  studied  in  this  paper  comes  from 
collection  of  96  monitoring  points  per  day.  Indicator  data  for 
single  user  per  day  can  be  regarded  as  a 96 -length  vector, 
and  data  of  n users  per  day  could  be  formed  as  a data  set  with 
n rows  and  96  columns,  each  line  of  the  data  set  could 
indicates  a single  user.  For  indicator  data  of  single  user 
multi-days,  96  monitor  points  per  day  can  be  used  as  96 
variables,  monthly  data  is  stored  asa30  (31)x96  matrix  and 
the  annual  data  stored  as  a 365  x 96  matrix.  In  this  paper,  the 
monthly  (30  days)  load  data  of  1000  users  are  randomly 
selected  from  the  power  user  information  acquisition  system 
as  samples. 

A.  Selection  of  Wavelet  Basis  Function 

Shannon  entropy  of  the  wavelet  coefficients  obtained 
from  different  wavelet  basis  functions  is  as  shown  in  Fig.2.  It 
can  be  seen  from  the  figure  that,  for  single-user  multi-day 
electricity  load  data,  the  Shannon  entropy  of  the  obtained 
wavelet  coefficients  is  the  smallest  when  d4  (Daubechies  4) 
is  the  basis  function  of  wavelet  transform;  for  single-user 
single-day  data,  the  Shannon  entropy  of  the  obtained  wavelet 
coefficients  is  small  when  la20  (Least  Asymetric  20),  bl20 
(Best  Localized  20)  and  d4  are  the  basis  functions.  Since  the 
d4  wavelet  basis  function  is  similar  to  the  structural 
characteristics  of  the  electricity  load  data,  this  paper  takes  the 
calculation  results  of  Shannon  entropy  into  account, 
combines  the  discrete  structure  of  electricity  load  data  to 
select  the  d4  function  as  the  basis  function  of  the 
decomposition  of  electricity  load  data. 
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Figure  2.  Shannon  entropy  of  wavelet  coefficients  obtained  by  using 

different  wavelet  basis  functions 
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In  this  paper,  d4  wavelet  basis  function  is  selected  for 
single-user  single-day  data  and  single-user  multi-day  data  to 
take  wavelet  multi-resolution  decomposition  according  to  the 
structural  characteristics  of  electricity  load  data.  As  shown 
in  Fig.  3,  the  higher  the  decomposition  stage  is,  the  more 
detailed  changes  of  the  original  data  from  both  the  frequency 
domain  and  the  time  domain. 
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Figure  3.  wavelet  multi-resolution  decomposition  example 


B.  Error  Analysis 

The  square  sum  of  the  error  between  data  obtained  from 
wavelet  transfonn  and  the  original  data  is  taken  as  the  error 
of  data  characterization  in  this  paper,  modeled  as 


Errsingle  = £;=6i  (y.  - xj)  (11) 

Err muitiPie  = £'L,  Z/1  Oj  - Xj)2  ( 1 2) 


In  Equation  (11)  Errswgte  represents  the  error  for  single- 
user  single-day  while  in  Equation  (12)  Errmuitipie  represents 
the  error  for  single-user  multi-day.  n indicates  the 
observation  days  of  single-user  multi-day,  indicates  the 
sequence  data  obtained  by  inverse  wavelet  transfonn,  and 
represents  the  original  sequence  data. 

The  error  between  the  inverse  transfonned  data  under 
different  B and  the  original  data  for  1 000  users  per  day  is  as 
shown  in  Fig.  4 and  that  for  1000  users  per  month  is  shown 
in  Fig.  5. 
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Figure  4.  the  error  for  multi -users  per  day 
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Figure  5.  the  error  for  multi-users  multi-days 

In  order  to  intuitively  show  the  difference  between  the 
wavelet  transfonned  data  with  the  original  data,  Fig.  6 takes 
a typical  user’s  load  curve  as  an  example.  When  B is  settled 
as  0,  10,  50  and  93,  the  difference  between  the  inverse 
wavelet  transfonned  sequence  data  and  the  original  data  is 
shown  in  Fig. 6. 

It  is  shown  in  Fig.4  and  Fig.6  that  the  mean  value  of  enor 
decreases  obviously  with  the  increasing  value  of  B when  B is 
less  than  50,  and  the  mean  value  of  single-user  multi-day 
data  does  not  exceed  0.15  and  increasing  the  number  of 
characteristic  wavelet  coefficients  can  not  significantly 
reduce  the  error  between  the  inverse  wavelet  transfonn 
sequence  and  the  original  sequence  data  when  B is  not  less 
than  50,  which  proves  that  there  is  a high  redundancy  in  the 
original  power  load  data  and  it  is  necessary  to  characterize 
the  data. 
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Figure  6.  the  difference  between  the  inverse  wavelet  transformed 
sequence  data  and  the  original  data  under  different  5 

IV.  Conclusions 

With  the  rapid  growth  of  global  data,  information  of  grid 
enterprise  is  also  growing  rapidly,  which  causes  great  trouble 
in  data  collection,  transformation,  storage  and  application. 
The  massive  accumulated  data  increase  grid  enterprise 
operation  cost.  Therefore,  data  transformation  simplification, 
data  compression  and  data  desensitization  have  become 
important  issues  of  grid  big  data  application. 
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This  paper  studies  the  basic  principle  of  wavelet 
transform  and  analyzes  the  application  value  of  multi- 
resolution  decomposition  of  data  characterization.  It  turns 
out  that  the  application  of  wavelet  transform  in  electricity 
load  data  for  data  characterization  achieves  great  effect, 
including  lower  data  storage  space,  more  simplified 
transmission  and  improved  analysis  efficiency.  However,  the 
selections  of  basis  function  and  wavelet  coefficients  as 
characterized  coefficients  are  of  vital  importance.  It  will  not 
only  affect  data  compression,  but  also  influences  the 
business  analysis  result. 

The  Shannon  entropies  of  the  wavelet  coefficients 
obtained  from  25  kinds  of  wavelet  basis  functions  are 
compared,  and  d4  function  is  chosen  as  the  basis  function 
based  on  the  minimum  entropy  principle.  In  addition,  the 
inverse  wavelet  transform  is  applied  to  characteristic  wavelet 
coefficients,  and  the  error  between  the  result  and  the  original 
data  is  calculated,  through  the  analysis  it  is  proved  that  the 
d4  wavelet  transform  can  be  used  for  data  characterization  to 
achieve  lower  data  storage,  more  simplified  data 
transmission  and  improved  analysis  efficiency. 

Data  characterization  based  on  wavelet  transform  for 
single-user  single-day  data  and  single-user  multi-day  data 
achieves  great  effect  on  data  compression  and  data 
desensitization,  which  makes  it  possible  for  big  data  in 
power  field  not  only  used  within  power  enterprises,  but  also 
combine  with  external  data  and  serve  for  more  fields.  Thus 
the  potential  value  of  power  data  could  be  reflected. 
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